AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Visual-Text Conversion

# Visual-Text Conversion

Uae License Detection
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder to process document images
Image-to-Text Transformers
U
codedrainer
21
2
Donut Proto
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for image-to-text conversion
Image-to-Text Transformers
D
naver-clova-ix
30
7
Poster2plot
This is an image captioning model that generates plot descriptions from movie/TV show posters. It produces decent plot summaries, though far from perfect. We are continuously improving the model.
Image-to-Text Transformers English
P
deepklarity
15
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase